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DUAL  TASK  PERFORMANCE  WITH  LPC  DEGRADED  SPEECH 
IN  A  SENTENCE  VERIFICATION  TASK 


INTRODUCTION 

There  are  many  voice  communications  situations  in  which  the  communicator  is 
performing  some  other  task  in  addition  to  listening  to  and  understanding  the  speech,  for 
example,  flying  an  airplane  or  driving  a  vehicle.  Often  these  situations  are  also 
circumstances  in  which  speech  intelligibility  can  be  expected  to  be  more  or  less  degraded 
due  to  the  transmission  channel  or  to  background  noise.  Small  losses  in  intelligibility  have 
little  effect  on  whether  ordinary  speech  is  comprehended,  and  with  the  more  distinctive  and 
highly  constrained  vocabularies  used  in  military  speech,  more  degradation  can  be  tolerated. 
However,  even  if  the  message  is  correctly  understood,  greater  mental  effort  may  be 
required  by  the  listener  to  understand  the  speech.  With  very  severe  degradations,  not  only 
is  listener  effort  further  increased,  but  errors  in  comprehension  will  occur.  Most  research 
on  the  effects  of  speech  degradation  on  intelligibility  and  other  performance  measures  such 
as  reaction  time  (RT)  has  been  conducted  with  the  speech  task  as  the  only  task.  This  paper 
reports  the  results  of  a  preliminary  study  on  the  effects  of  reduced  speech  intelligibility  on 
dual  task  performance. 

The  ability  to  perform  two  tasks  simultaneously  depends  on  the  amount  and 
efficiency  of  the  mental  resources  allocated  to  the  two  tasks  (e.g.,  Navon  &  Gopher,  1979, 
Norman  &  Bobrow,  1975).  According  to  multiple  resource  theory,  (e.g.,  Wickens, 
1984),  the  degree  to  which  two  tasks  will  interfere  with  one  another  depends  both  on  the 
resource  demands  of  the  two  tasks  and  on  the  resource  composition  of  the  tasks.  If  the 
tasks  draw  on  different  resources,  this  should  result  in  less  interference  than  if  they  draw 
on  the  same  resources,  although  recent  evidence  (Wickens  and  Liu,  1988)  suggests  that 
there  may  also  be  a  cost  associated  with  scanning  between  different  modalities.  Regardless 
of  which  version  of  resource  theory  we  accept,  we  usually  expect  some  decrement  in 
performance  when  two  tasks  are  performed  simultaneously,  and  it  is  what  happens  when 
the  difficulty  of  the  speech  task  is  increased  that  is  of  particular  interest  here. 

We  selected  a  visual  picture  sorting  task  as  the  second  task.  In  many  real  situations 
where  degraded  voice  communication  occurs,  the  concurrent  task  demands  are  likely  to  be 
visual-spatial  in  nature.  According  to  multiple  resource  theory,  less  competition  can  be 
expected  between  a  visual  and  a  verbal  task  than  between  two  verbal  tasks.  It  is  probable 
that  some  overall  decrement  would  occur  for  both  tasks  due  to  dual-task  demands.  If  the 
mental  resources  required  to  process  the  speech  task  are  separate  from  those  needed  for  the 
visual  task,  the  more  difficult  degraded  speech  conditions  should  affect  performance  only 
on  the  speech  task.  If  there  is  also  an  additional  decrement  on  the  visual  task  due  to 
increasing  the  speech  difficulty,  this  would  suggest  that  both  tasks  are  drawing  on  shared 
resources  and  that  the  extra  effort  required  for  the  speech  task  interferes  with  the  processing 
of  the  visual  task  as  well. 

The  speech  task  was  a  sentence  verification  task  (Larkey  and  Danly,  1983),  and  the 
speech  degradation  was  accomplished  using  a  narrowband  digital  voice  transmission 
system  operating  with  and  without  random  bit  errors.  Narrowband  digital  voice 
transmission  systems  for  secure  voice  applications,  both  commercial  and  military,  are 
coming  into  increasingly  widespread  use.  A  linear  predictive  coding  (LPC)  algorithm 
operating  at  2.4  kbps  has  been  established  as  the  DoD  standard  (MIL-STD- 199-1 13  or 
Federal  Standard  1015).  Enhanced  versions  of  this  algorithm  (LPC-lOe)  have  been 
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incorporated  in  the  Subscriber  Terminal  Unit  (STU-III)  and  in  the  Navy’s  A-  anced 
Narrowband  Digital  Voice  Terminal  (ANDVT).  Even  slight  losses  in  quality  can  lower  the 
scores  on  intelligibility  tests  such  as  the  Diagnostic  Rhyme  Test  (DRT)  (Voiers,  1977), 
which  measures  the  discriminability  of  word  pairs  differing  only  in  a  single  distinctive 
feature.  Intelligibility  tests  indicate  that  although  scores  for  LPC-lOe  processed  speech  are 
lower  than  for  wideband  speech,  intelligibility  is  nevertheless  quite  good,  with  a  score  of 
89.8  on  the  DRT  (Sandy,  in  press)  and  98%  correct  recognition  of  the  words  of  the 
International  Civil  Aviation  Organization  (ICAO)  spelling  alphabet  and  digits  (Schmidt- 
Nielsen,  1987).  Scores  for  undegraded  speech  were  97.6  for  the  DRT  and  99%  correct  for 
the  spelling  alphabet.  For  LPC  with  5%  random  bit  errors,  the  DRT  score  falls  to  about  75 
and  only  about  90%  of  the  spelling  alphabet  words  are  correctly  understood.  Kallman  and 
Schmidt-Nielsen  (1989)  used  a  sentence  verification  task  to  evaluate  the  comprehension  of 
unprocessed  speech  and  of  LFC  processed  speech  with  and  without  random  bit  errors. 
Response  accuracy  decreased  and  reaction  time  increased  with  LPC  processing  and  with 
increasing  bit  error  rate  (ber). 

METHOD 

Tasks 

The  two  experimental  tasks  were  a  sentence  verification  task,  in  which  listeners 
decided  whether  a  spoken  sentence  was  true  or  false,  and  a  timed  card  sorting  task.  The 
listeners  were  seated  at  a  table  and  had  a  response  box  with  two  push  buttons  labelled 
TRUE  and  FALSE  by  the  preferred  hand.  A  master  card  with  pictures  of  24  common 
objects  was  placed  by  the  nonpreferred  hand,  with  an  oblong  box  beside  it  containing  small 
cards  with  pictures  of  individual  objects.  For  the  sentence  verification  task,  the  listeners 
heard  28  sentences  over  high  quality  headphones  and  used  the  index  and  middle  fingers  of 
the  preferred  hand  to  press  the  labelled  push  buttons.  They  were  to  decide  whether  each 
sentence  was  true  or  false  and  to  push  the  appropriate  button  as  quickly  as  possible  while 
avoiding  mistakes.  The  card  sorting  task  was  timed  to  last  barely  longer  than  the  sentence 
presentation,  so  that  in  the  dual  task  condition,  sorting  began  just  before  the  first  sentence 
and  ended  just  after  the  last  one.  The  words  "Start"  and  "Stop"  spoken  in  a  female  voice 
and  heard  over  the  headphones  were  used  to  cue  the  subjects  when  to  begin  and  end  the 
sorting  task.  Subjects  used  the  nonpreferred  hand  to  pick  up  an  object  card  one  at  a  time 
from  the  box,  turn  it  over,  and  place  it  on  the  corresponding  picture  on  the  master  card. 
The  object  was  to  sort  as  many  cards  as  possible  during  the  time  interval  between  the 
words  "Start"  and  "Stop." 

Speech  Materials 

The  speech  materials  were  the  same  as  those  used  by  Kallman  and  Schmidt-Nielsen 
(1989).  The  test  items  consisted  of  96  true  and  96  false  sentences.  Sixty  additional 
sentences  were  used  for  practice  and  for  fillers.  The  practice  list  and  the  eight  test  lists  had 
28  items  each.  The  first  4  items  (2  true  and  2  false)  in  each  test  list  were  fillers  and  were 
not  scored.  The  remaining  24  items  in  each  list  were  the  test  sentences  consisting  of  equal 
numbers  of  true  and  false  statements  equally  distributed  across  strong  and  weak  property 
and  category  relationships.  The  lists  were  recorded  by  a  male  speaker  with  approximately 
2  sec  of  silence  between  sentences. 

The  test  tapes  were  identical  to  those  used  by  Kallman  and  Schmidt-Nielsen  (1989). 
Four  speech  conditions  were  tested:  high  quality  unprocessed  speech  and  LPC  processed 
speech  with  0%,  2%,  and  5%  random  bit  errors.  Four  counterbalanced  sequences  of  eight 
lists  each  were  prepared.  The  order  of  the  four  processing  conditions  was  balanced  across 
sequences,  and  the  order  in  which  the  different  processing  conditions  were  presented  in  the 


second  half  of  each  sequence  was  the  reverse  of  the  order  in  the  first  half  to  control  for 
possible  practice  and/or  fatigue  effects. 

Sorting  Task  Materials 

The  stimuli  for  the  card  sorting  task  were  90  black  and  white  drawings  of  common  objects. 
Arbitrarily  selected  subsets  of  24  pictures  were  used  to  generate  15  different  master  card 
layouts  (4  rows  by  6  columns),  such  that  no  two  master  cards  had  more  than  six  pictures  in 
common.  For  each  master  card,  a  stack  of  object  cards  was  made  with  multiple  copies  of 
each  of  the  24  pictures  from  that  master  card.  A  different  master  card  was  used  for  each 
trial,  and  the  object  cards  weie  shuffled  before  each  trial  and  placed  in  a  small  rectangular 
box  next  to  the  subject,  with  the  picture  side  of  the  cards  facing  away  from  the  subject. 

Subjects  and  Procedure 

The  listeners  were  48  undergraduate  psychology  students  at  the  University  of 
Maryland  (12  for  each  of  the  four  sequences)  who  volunteered  to  participate  for  extra 
course  credit.  The  listeners  were  tested  individually.  Each  listener  was  first  familiarized 
with  the  sound  of  LPC  speech  by  listening  to  LPC  processed  versions  of  five  different 
speakers  each  reading  the  same  30  sec  paragraph.  This  was  followed  by  a  practice  trial  on 
the  sentence  verification  task,  in  which  the  subjects  heard  the  practice  list  in  a  fairly  difficult 
speech  condition,  LPC  processed  speech  with  2%  bit  errors.  After  practice  on  the 
verification  task,  subjects  were  given  two  practice  trials  on  the  sorting  task  alone.  The  nnai 
practice  trial  combined  the  two  tasks,  again  using  LPC  processed  speech  with  2%  bit  errors 
Each  subject  then  completed  one  of  the  counterbalanced  sequences  of  eight  dual-task  test 
lists,  with  a  5-10  minute  break  between  the  first  and  second  half  of  testing.  At  the  end 
there  was  a  single  trial  of  card  sorting  alone. 

Scoring  Procedure 

The  responses  and  reaction  times  to  the  sentence  verification  task  were  collected  and 
stored  by  an  IBM  PC  computer.  The  reaction  times  were  calculated  from  the  end  of  the  last 
word  of  each  sentence  as  determined  by  visual  inspection  of  the  digitized  waveform.  The 
number  of  cards  correctly  sorted  was  divided  by  the  time  from  start  to  stop  to  obtain  the 
number  of  correct  sorts  per  minute  for  each  trial. 

RESULTS 

Separate  analyses  of  variance  were  carried  out  for  each  of  the  dependent  measures: 
reaction  time  and  percentage  correct  responses  for  the  sentence  verification  task  and  number 
of  pictures  sorted  per  minute  for  the  picture  sorting  task.  The  degrees  of  freedom  for  the  F 
tests  were  corrected,  where  appropriate,  for  violations  of  sphericity  using  the  Huynh  and 
Feldt  (1976)  correction. 

The  effect  of  speech  processing  condition  on  the  sentence  verification  and  sorting 
tasks  is  shown  in  Table  1.  As  expected,  reaction  times  and  errors  for  the  sentence 
verification  task  increased  as  the  intelligibility  of  the  speech  conditions  decreased;  E>  (2.95, 
138.87)  =  63.30,  £  <  0.0001,  M£e  =  62,434,  and  E,  (2.74,  128.93)  =  111.95,  £  < 
0.0001,  MSe  =  154.17,  for  RTs  and  errors  respectively.  In  addition  to  affecting  the 
verification  task,  increasing  the  difficulty  of  the  speech  conditions  also  led  to  decreased 
performance  on  the  picture  sorting  task,  E,  (2.99,  140.35)  =  20.69,  £  <  0.0001,  MSe  = 
3.49.  This  suggests  that  the  increased  effort  required  to  understand  the  more  difficult 
speech  left  less  processing  capacity  available  for  performing  the  sorting  task. 
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Table  1.  The  effect  of  speech  processing  condition  on  mean  reaction  time 
and  percentage  errors  for  the  sentence  verification  task  and  on  number  of 
pictures  correctly  sorted  for  the  sorting  task. 


Condition 


Verification  Task  Card  Sorting 

Mean  RT  %  Errors  Sorts  /  minute 


Unprocessed 

551.0 

5.7 

23.07 

LPC,  0%  ber 

646.5 

9.5 

22.87 

LPC,  2%  ber 

701.9 

13.2 

22.39 

LPC,  5%  ber 

793.3 

21.4 

21.14 

Figure  1  compares  dual  task  performance  with  single  task  performance.  Single  task 
performance  for  the  picture  sorting  task  was  determined  by  averaging  the  second  single 
task  practice  sort  before  dual  task  testing  with  the  final  single  task  sort  after  the  dual  task 
testing.  It  can  be  seen  that  there  was  a  decrease  in  overall  performance  from  the  single  task 
to  the  dual  task  condition  as  well  as  an  additional  decrease  due  to  decreasing  speech  quality 
on  the  verification  task.  Single  task  data  on  the  sentence  verification  task  were  not  obtained 
in  this  experiment  because  of  time  constraints.  For  comparison  purposes,  the  data  from  the 
earlier  single  task  sentence  verification  study  by  Kallman  and  Schmidt-Nielsen  (1989)  are 
replotted  here.  When  we  compared  mean  RTs  and  error  rates  on  the  practice  trial  for  the 
two  experiments,  they  were  virtually  identical.  This  suggests  that  the  two  experiments 
were  reasonably  comparable.  A  comparison  of  the  sentence  verification  results  of  the 
present  experiment  with  those  from  the  previous  experiment  suggests  that  adding  a  sorting 
task  had  no  effect  on  overall  error  rates  but  did  increase  the  response  time  to  the  sentence 
verification  task. 

The  data  analyses  also  showed  some  effects  of  practice  in  the  first  half  vs.  second 
half  comparisons.  The  percentage  of  verification  errors  decreased  from  13.4%  to  11.5%, 
E,  (1,47)  =8.07,  p.  <  0.01,  MSe  =  161.99,  and  the  number  of  correct  sorts  per  minute 
increased  from  21.5  to  23.3,  E,  (1,47)  =  58.73,  p  <  0.001,  MSe  =  4.54,  but  there  was  no 
improvement  in  RTs,  £  (1,47)  =  1.20,  MSe  =  62.434.  The  reduction  in  the  error  rate 
suggests  that  practice  with  the  LPC  processed  speech  made  it  easier  to  understand,  and  this 
agrees  with  previous  research  on  LPC  speech,  which  shows  rapid  improvement  with 
practice  over  the  first  few  trials  (e.g.,  Schmidt-Nielsen  &  Everett,  1982).  The 
improvement  on  the  sorting  task  seems  to  be  largely  attributable  to  a  gradual  overall 
increase  in  performance  over  trials. 

DISCUSSION 

In  comparison  with  single  task  performance,  there  was  a  decrement  in  performance  on  card 
sorting  task  in  the  dual  task  condition,  and  the  number  of  correct  sorts  decreased  further 
with  increasing  speech  degradation  .  The  fact  that  performance  on  the  card  sorting  task 
decreased  from  the  single  task  to  the  dual  task  condition  is  not  surprising.  The  additional 
decrease  in  performance  with  increasing  difficulty  of  the  speech  task  suggests  that  over  and 
above  the  expected  dual  task  decrement,  the  extra  effort  required  to  understand  the 
degraded  speech  further  decreases  the  mental  processing  capacity  available  for  performing 
the  sorting  task. 
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Figure  1.  Comparison  of  single  task  and  dual  task  performance  for  the  sentence 
verification  task:  mean  RT  (top  panel),  percentage  errors  (middle  panel),  and  number  of 
correct  sorts  per  minute  for  the  picture  sorting  task  (bottom  panel). 
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The  comparison  between  single  task  and  dual  task  performance  for  the  verification 
task  involves  comparing  the  present  results  with  the  results  of  a  previous  experiment,  so 
any  conclusions  should  be  considered  somewhat  tentative  even  though  the  two  subject 
groups  were  very  similar  in  performance.  For  both  single  task  and  dual  task  conditions, 
die  percentage  of  verification  errors  increased  with  the  less  intelligible  speech  conditions,  as 
would  be  expected,  but  there  were  no  more  errors  under  dual  task  conditions  than  when  the 
verification  task  was  performed  alone.  The  addition  of  the  sorting  task  did  not  affect 
overall  comprehension  of  the  sentences.  There  was,  however,  an  increase  in  RT  under 
dual  task  conditions.  It  appears  that  the  increase  in  RT  for  the  sentence  verification  task 
was  fairly  constant  across  processing  conditions,  which  is  to  be  expected  since  the  sorting 
task  did  not  vary  in  difficulty. 

Both  tasks  have  a  cognitive  component  and  both  require  a  motor  output,  so  some 
competition  for  resources  is  to  be  expected  and  coordination  between  the  motor  outputs  is 
required.  One  can  expect  some  output  interference  between  the  two  tasks  since  both  the 
sorting  task  and  the  button  press  for  the  verification  task  required  a  motor  response.  The 
act  of  picking  up  and  putting  down  cards  with  one  hand  is  likely  to  lead  to  somewhat 
longer  RTs  to  the  button  push.  The  act  of  pressing  the  Hutton  might  also  result  in  slightly 
fewer  pictures  being  sorted  if  the  mechanical  act  of  pushing  the  button  with  one  hand  slows 
down  the  picking  up  and  putting  down  of  cards  with  the  other  hand,  but  that  effect  should 
be  small.  The  mental  processing  for  the  sorting  task  involved  identifying  the  picture  and 
locating  the  correct  match  on  the  layout.  The  mental  processing  for  the  verification  task 
involved  comprehension  of  the  sentence  and  deciding  whether  the  statement  was  true  or 
false.  The  additional  effort  required  to  understand  the  speech  in  the  degraded  conditions 
led  to  a  further  decrease  in  the  card  sorting  beyond  that  caused  by  the  dual  task  situation 
alone.  Thus,  it  would  appear  that  both  tasks  are  drawing  on  shared  resources  and  that  the 
extra  effort  required  for  the  speech  task  interferes  with  the  processing  of  the  visual  task  as 
well. 

SUMMARY  AND  CONCLUSIONS 

This  experiment  investigated  the  effects  of  reduced  speech  intelligibility  on  dual  task 
performance  using  a  sentence  verification  task  and  a  picture  sorting  task.  The  speech 
degradation  was  accomplished  using  a  narrowband  digital  voice  transmission  system  (LPC 
at  2400  bits/s)  operating  with  and  without  random  bit  errors.  Reaction  times  and  errors  for 
the  sentence  verification  task  increased  as  the  intelligibility  of  the  speech  conditions 
decreased.  The  addition  of  the  sorting  task  did  not  affect  overall  comprehension  of  the 
sentences,  i.e.,  there  were  no  more  errors  for  the  dual  task  conditions  than  for  the 
corresponding  single  task  conditions.  Reaction  time  for  the  speech  task  increased  with  the 
concurrent  sorting  task,  but  the  increase  was  constant  across  speech  conditions,  which  is  to 
be  expected  since  the  sorting  task  did  not  vary  in  difficulty.  There  was  a  significant  dual 
task  decrement  on  the  sorting  task,  and  in  addition,  there  was  an  further  decrease  in  sorts 
per  minute  as  the  speech  was  increasingly  degraded.  The  additional  decrease  in 
performance  on  the  sorting  task  with  increasing  difficulty  of  the  speech  task  suggests  that 
over  and  above  the  expected  dual  task  decrement,  the  extra  effort  required  to  understand  the 
degraded  speech  further  decreased  the  mental  processing  capacity  available  for  performing 
the  sorting  task.  It  would  appear  that  both  tasks  are  drawing  on  shared  resources  and  that 
the  extra  effort  required  for  the  speech  task  interferes  with  the  processing  of  the  visual  task 
as  well. 
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